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DETAILED ACTION 

1. Claims 17-21,. 23-26, and 28-29 were considered. 

Papers Submitted 

2. It is hereby acknowledged that the following papers have been received and placed of 
record in the file: Amendment as filed 1 1 January 2007 and IDS as filed 22 February 2007. 

Information Disclosure Statement 

3. The information disclosure statement filed 22 February 2007 fails to comply with the 
provisions of 37 CFR 1.97, 1.98 and MPEP § 609 because Other Document EF Patterson et al., 
was previously cited in another IDS. It has been placed in the application file, but the 
information referred to therein has not been considered as to the merits. Applicant is advised 
that the date of any re-submission of any item of information contained in this information 
disclosure statement or the submission of any missing element(s) will be the date of submission 
for purposes of determining compliance with the requirements based on the time of filing the 
statement, including all certification requirements for statements under 37 CFR 1.97(e). See 
MPEP § 609.05(a). 

Double Patenting 

4. The nonstatutory double patenting rejection is based on a judicially created doctrine 
grounded in public policy (a policy reflected in the statute) so as to prevent the unjustified or 
improper timewise extension of the "right to exclude" granted by a patent and to prevent possible 
harassment by multiple assignees. A nonstatutory obviousness-type double patenting rejection 
is appropriate where the conflicting claims are not identical, but at least one examined 
application claim is not patentably distinct from the reference claim(s) because the examined 
application claim is either anticipated by, or would have been obvious over, the reference 
claim(s). See, e.g., In re Berg, 140 F.3d 1428, 46 USPQ2d 1226 (Fed. Cir. 1998); In re 
Goodman, 1 1 F.3d 1046, 29 USPQ2d 2010 (Fed. Cir. 1993); In re LongU 759 F.2d 887, 225 
USPQ 645 (Fed. Cir. 1985); In re Van Ornum, 686 F.2d 937, 214 USPQ 761 (CCPA 1982); In re 
Vogel, 422 F.2d 438, 164 USPQ 619 (CCPA 1970); and In re Thorington, 418 F.2d 528, 163 
USPQ 644 (CCPA 1969). 
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5. A timely filed terminal disclaimer in compliance with 37 CFR 1.321(c) or 1.321(d) may 
be used to overcome an actual or provisional rejection based on a nonstatutory double patenting 
ground provided the conflicting application or patent either is shown to be commonly owned 
with this application, or claims an invention made as a result of activities undertaken within the 
scope of a joint research agreement. 

6. Effective January 1, 1994, a registered attorney or agent of record may sign a terminal 
disclaimer. A terminal disclaimer signed by the assignee must fully comply with 37 CFR 
3.73(b). 

7. Claims 17, 20-21, 23-24, 26, and 28-29 are rejected on the ground of nonstatutory 
obviousness-type double patenting as being unpatentable over claims 1, 3, 9, and 10 of U.S. 
Patent No. 6,668,317 (herein referred to as '317) in view of Karguth, U.S. Patent Number 
6,223,277 (herein referred to as Karguth) and in further view of David K. Probst's 
"Programming, Compiling and Executing Partially-Ordered Instruction Streams on Scalable 
Shared-Memory Multiprocessors" from Proceedings of the Twenty-Seventh Annual Hawaii 
International Conference on System Sciences. 1 994 ©1994 IEEE (herein referred to as Probst). 
Table I below shows the relations between the claims of the instant application, '31 7, Karguth, 
and Probst. A person of ordinary skill in the art at the time the invention was made, and as 
taught by Karguth, would have recognized that the packed data system maximizes on-chip 
utilization of memory and obtains performance at minimum cost (Karguth column 2, lines 50-65 
"...As a result, packed data structures are attractive in these type of systems..."). Therefore, it 
would have been obvious to a person of ordinary skill in the art at the time the invention was 
made to incorporate the packed data device of Karguth in the device of £ 3 17 to maximize on-chip 
memory utilization and minimize cost. In addition, a person of ordinary skill in the art at the 
time the invention was made, and as taught by Probst, would have recognized that multithreading 
improves tolerance of latencies and increases processor utilization (Probst Section 2, paragraph 1 
"Multithreading is commonly suggested as a technique for tolerating latencies and increasing 
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processor utilization. . .")• Therefore, it wou ld have been obvious to a person of ordinary skill in 
the art at the time the invention was made to incorporate the multithreading of Probst in the 
device of Karguth to improve latency tolerance and increase processor utilization. 



Tabic I 



Instant Application 


Patent Number 6,668,317 


Claim 17 


Claims 1,3,9, 10 


A hardware-based multithreaded 
processor comprising: 


Probst Section 2, paragraph 1 "Multithreading is 
commonly suggested as a technique..." 


A nluralitv of microen pines 


Knroiijh rnlumn S lineQ 19-1^ ct another inctunpp 
of network huh and ATlVf translator ^ wonlH he 

implemented in place of ATM premises switch 8..."; 
column 3, lines 40-43 "The present invention may be 
implemented in a micro-processor architecture..."; 
and Figure 1, element 5 


Each of the microengines comprising 


A microcontrolled functional execution unit 
comprises (Claim 1): 


A context event arbiter, 


A context event arbiter, which in response to external 
flags, determines which one of a plurality of threads 
executable in the microcontrolled functional 
execution unit to promote to an execution state 
(Claim 1). 


A controller, 


A microengine controller for maintaining a plurality 
of microprogram counters, and decode logic for 
decoding instructions (Claim 1); 


A control store, 


A control store to store a microprogram(Claim 1); 


Local read and write transfer registers, 


A read transfer register bank (Claim 9); and 
A write transfer register bank, with the read and write 
transfer register banks divided into a plurality of 
windows that correspond to the number of 
microprogram counters supported in the microengine 
controller (Claim 9) 

A read transfer register bank (Claim 10); and 
A write transfer register bank, with the read and write 
transfer register banks divided into a plurality of 
banks assigned for different shared resources in the 
microengine controller (Claim 10). 
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Local general purpose registers, and an 
arithmetic logic unit (ALU), 


An arithmetic logic unit and shifter controlled by 
decoded signals produced from the microengine 
controller (Claim 3); and 

A general purpose register bank to store and obtain 
operands for the arithmetic logic unit (Claim 3). 


Each of the microengines supporting 
instructions that 


Karguth column 7, lines 47-67 "ALU 30 includes the 
appropriate circuitry for executing arithmetic and 
iUL^icai upciaLiuiio upuii upciaiiUD prcbentcu 
thereto..."; column 9, lines 27-32 "...provide a five- 
bit selection code by way of which the destination 
and source registers, respectively, for the instruction 
are addressed..."; column 9, lines 58 to column 10, * 
line 9 "...contain an immediate operand value for 
use... contain a second source register..."; and Figure 
3, element 30 


perform an ALU operation on one or 
two operands, 


Karguth column 7, lines 47-67 "ALU 30 includes the 
appropriate circuitry for executing arithmetic and 
joyicdi optidiions upon operanus presented 
thereto..."; column 9, lines 27-32 "...provide a five- 
bit selection code by way of which the destination 
and source registers, respectively, for the instruction 
are addressed..."; column 9, lines 58 to column 10, 
line 9 "...contain an immediate operand value for 
use... contain a second source register..."; and Figure 
3. element 30 


x>/t']jvjoii a itouii in a uco UllaliUll iCc^iMCI 

and 


jxdiguui coiumn o, lines i-zj ...aata results mat are 
to be written back into register file 24 are applied to 
shifter 34, to place the operand in the appropriate bit 
positions of the destination of one of registers..."; 
column 9, lines 27-32 "...provide a five-bit selection 
code by way of which the destination and source 
registers, respectively, for the instruction are 
auui tobou. . . , cinu i igure j 


Update ALU condition codes according 
to the result; and 


Karguth column 1 1, lines 35-51 "...the test is 
evaluated, thus resulting in a singlecycle test and 
branch instruction " — In regards to Kannith the 
test results essentially update the condition codes, 
eventhough they are not stored in a register, in order 
to determine if the condition is true or not and to 
branch accordingly. 


A local register instruction that loads 
one or more bytes, specified by a 
multiple-bit field of the instruction, 


Karguth column 7, lines 47-67 "ALU 30 includes the 
appropriate circuitry for executing arithmetic and 
logical operations upon operands presented 
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within a local destination register with 
a shifted value of another operand, 


thereto..."; column 9, lines 27-32 "...provide a five- 
bit selection code by way of which the destination 
and source registers, respectively, for the instruction 
are addressed..."; column 9, lines 58 to column 10, 
line 9 "...contain an immediate operand value for 
use., .contain a second source register..."; and Figure 
3. element 30 


The field representing a mask in which 
each bit of the mask identifies a 
different byte of the destination 
register. 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
lo . . . , column iu, lines iu-4o ...wrne-DacK 
upciaiiun accuiuuig iu uic picierrcu einuouimeni oi 
the present invention also shifts the output result 
from ALU 30 into the proper position within the 
result word. . .so that one or more of the byte 
locations may be written with the contents of 
writeback bus WBBUS (the remaining bits being 
masked from the write). . and Figure 3 


Claim 20 




l iic pruLcaour ui claim 1 /, wnerein me 

nPQtinati on rpcxicfpr iq 5i crpnpr£i1 nnrnr^c^ 
u^oiiiidliuii iCglaLCl lo a gCUClal puipUoC 

register. 


tvui guin column o, lines i-zj ...aata results tnat are 
iu uc wiiutii DdtK into legisiei ine are appnea to 
shifter 34, to place the operand in the appropriate bit 
positions of the destination of one of registers..."; 
column 9, lines 27-32 "...provide a five-bit selection 
code by way of which the destination and source 
registers, respectively, for the instruction are 
addressed..."; and Figure 3 


Claim 21 




1 hp nrnrpc c*~\t r\T /""lcnin 1 / a \ ?\~\ a t-^ i n flip 
i lie piUL/CooUi ui Claim i /, wiiciciii iiic 

lopfil rpcJiQfpr mQtniptinn r*r\iTTnn*QpQ flip 

lU^Cll ivglold llloLI UL-llVJl 1 CVJlll Ul loCb iiic 

destination register. 


ax// gum coiumn o, lines i-zj ...aaia results mat are 

IU UC VVIlLLCIl UdCK U1LO lcglbLcr IllC Z'r <UC appilCU IO 

shifter 34, to place the operand in the appropriate bit 
positions of the destination of one of registers..."; 
column 9, lines 27-32 . .provide a five-bit selection 
code by way of which the destination and source 
registers, respectively, for the instruction are 
addressed..."; and Figure 3 


Claim 23 




The processor of claim 1 7, wherein the 
mask is 4-bits. 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
28..."; column 10, lines 10-48 "...write-back 
operation according to the preferred embodiment of 
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the present invention also shifts the output result 
l.ioin alu ju into me proper position wumn me 

1 Co LIU WU1U. . . UldL UIlC Ul lllUic Ul lilt Vy IC 

locations may be written with the contents of 
writeback bus WBBUS (the remaining bits being 
masked from the write). .." and Figure 3 - In regards 
to Karguth. the exemplary mask is 3-bits, however, 
the size of the mask field does not matter and it is 
only an exemplary embodiment. 


Claim 24 




The processor of claim 1 7, wherein the 
mask comprises a set bit indicating a 
corresponding byte in the local register 
to be loaded. 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
zo... , coiumn iu, lines lu-^fo ...wnte-DacK 
upcicuiuii accuiunig iu Liic piciciTcu cmDouimeni or 

tlip nt'P^pnt invpntinn alQr\ chiftc tVip* r\ntrMit r^cult 
UlC JJ.l. CoClll IUVCULIU11 alSU ollllLb L11C UUIUUL 1 Colli I 

from ALU 30 into the proper position within the 
result word. . .so that one or more of the byte 
locations may be written with the contents of 
writeback bus WBBUS (the remaining bits being 
masked from the write)..." and Figure 3 


Claim 26 




Apparatus comprising: 




A hardware-based multithreaded 
processor comprising 


Probst Section 2, paragraph 1 "Multithreading is 
commonly suggested as a technique..." 


A nlnrjilitv nf mirrnpncrinpc 
^jL\xiciiiiy \jL niiciuciitiiiicoj 


jYtit gifu i cuuuiiii j, imes iz-io . , . anoiner insxance 
of network hub and ATM translator 5 would be 
implemented in place of ATM premises switch 8..."; 
column 3, lines 40-43 'The present invention may be 
implemented in a micro-processor architecture..."; 

tmH rMcnirf* 1 p pmpnt ^ 
(UIU 1 limine I, C1CIIICIU J 


Each of the microengines comprising 


A microcontrolled functional execution unit 
comprises (Claim 1): 


A Pnntpvt pvpnt arViitpr 

■*V CClllCAl CVvlll CU UllCl ; 


r\ conic x i cvciu aiDiieij wnicn in response 10 external 
fla^s determines which one of a nluralitv of threads 
executable in the microcontrolled functional 
execution unit to promote to an execution state 
(Claim 1). 


A controller, 


A microengine controller for maintaining a plurality . 
of microprogram counters, and decode logic for . 
decoding instructions (Claim 1); 


A control store, 


A control store to store a microprogram(Claim 1); 
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Local read and write transfer registers, 


A read transfer register bank (Claim 9); and 
A write transfer register bank, with the read and write 
transfer register banks divided into a plurality of 
windows that correspond to the number of 
microprogram counters supported in the microengine 
controller (Claim 9) 

A read transfer register bank (Claim 10); and 
A write transfer register bank, with the read and write 
transfer register banks divided into a plurality of 
banks assigned for different shared resources in the 

mi rrnpn oinp control Ipr ff^l^im 1 OA 

1 1 1 1 VJwl Ic^lilv WJIlllWlItl I v^lCliill L\JJ. 


Local general purpose registers, and an 
arithmetic logic unit (ALU), 


An arithmetic logic unit and shifter controlled by 
decoded signals produced from the microengine 
controller (Claim 3); and 

A general purpose register bank to store and obtain 
operands for the arithmetic logic unit (Claim 3). 


Each of the plurality of microengines 
including a command that causes the 
ALU to load one or more bytes, 
specified by a multiple-bit field of the 
command, within a destination register 
of a selected microengine with a shifted 
value of another one or more bytes of a 
source register, 


Karguth column 7. lines 47-67 "ALU 30 includes the 
appropriate circuitry for executing arithmetic and 
logical operations upon operands presented 
thereto..."; column 9, lines 27-32 "...provide a five- 
bit selection code by way of which the destination 
and source registers, respectively, for the instruction 
are addressed..."; column 9, lines 58 to column 10, 
line 9 ..contain an immediate operand value for 
use... contain a second source register..."; and Figure 
3. element 30 

Karguth column 7, lines 47-67 "ALU 30 includes the 
appropriate circuitry for executing arithmetic and 
logical operations upon operands presented 
thereto..."; column 9, lines 27-32 "...provide a five- 
bit selection code by way of which the destination 
and source registers, respectively, for the instruction 
are addressed "* column 9 lines to column 10 
line 9 ..contain an immediate operand value for 
use... contain a second source register..."; and Figure 
3, element 30 

Karguth column 8, lines 1-23 "...data results that are 
to be written back into register file 24 are applied to 
shifter 34, to place the operand in the appropriate bit 
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positions of the destination of one of registers. .."; 
column 9, lines 27-32 "...provide a five-bit selection 
code by way of which the destination and source 
registers, respectively, for the instruction are 
addressed... 5 '; and Figure 3 

Karguth column 1 1, lines 35-51 "...the test is 
evaluated, thus resulting in a singlecycle test and 
branch instruction. . - In regards to Karguth, the 
test results essentially update the condition codes, 
eventhough they are not stored in a register, in order 
to determine if the condition is true or not and to 
branch accordingly. 

Karguth column 1, lines 47-67 "ALU 30 includes the 
appropriate circuitry for executing arithmetic and 
logical upeia nuns upon opeiaiius presented 
thereto..."; column 9, lines 27-32 "...provide a five- 
bit selection code by way of which the destination 
and source registers, respectively, for the instruction 
are addressed. . ."; column 9, lines 58 to column 10, 
line 9 "...contain an immediate operand value for 
use. ..contain a second source register..."; and Figure 
3, element 30 


The field representing a mask in which 
each bit of the mask identifies a 
different byte of the destination 
register. 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
*o.. ( . coiumn i u, lines iu-ho . ..write- oacK 

rinfTPi f"i o n 5\Cf*r\vr\ii~\cr tr\ trip nrPTArrpH pm V*\r>H i m pnt r\-F 

ujjv^i aiiuti acuuiuuij^ lu uic picj.ciicu cin Duuirncni OI 
the present invention also shifts the output result 
from ALU 30 into the proper position within the 
result word. . .so that one or more of the byte 
locations may be written with the contents of 
writeback bus WBBUS (the remaining bits being 
masked from the write)..." and Figure 3 


Claim 28 




The apparatus of claim 26, wherein the 
mask is 4-bits. 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
28..."; column 10, lines 10-48 "...write-back 
operation according to the preferred embodiment of 
the present invention also shifts the output result 
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J1U1U r\Lj\J OU UllU U1C piupfc/I pOblllun VVlLXUll LUC 

rfsiilt wnrH cn tlint nnp r»r mnrp ^"FtViP Vivfp 
1 ^iUll WUIU. , ,3U UlcU UllW Ul 111U1C VJ1 UJ.C Uy It 

locations may be written with the contents of 
writeback bus WBBUS (the remaining bits being 
masked from the write). . ." and Figure 3 - In regards 
to Karguth, the exemplary mask is 3-bits, however, 
the size of the mask field does not matter and it is 
only an exemplary embodiment. 


Claim 29 




The apparatus of claim 26, wherein the 
mask comprises a set bit indicating a 
corresponding byte in the source 
register to be loaded. 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
28 "'column 10 lines 10-48" write-haek 

+*yJ ■ . . , w^rilllllll 1 W ; lllll/O 1 v ~\j ... VV 1 1 I W kJCL\* JV 

operation according to the preferred embodiment of 
the present invention also shifts the output result 
from ALU 30 into the proper position within the 
result word. . . so that one or more of the byte 
locations may be written with the contents of 
writeback bus WBBUS (the remaining bits being 
masked from the write)..." and Figure 3 



8. Claims 17-21, 23-26, and 28-29 are rejected on the ground of nonstatutory obviousness- 



type double patenting as being unpatentable over claims 1 , 2-3, and 5-6 of U.S. Patent No. 
7,191,309 (herein referred to as '309) in view of Karguth, U.S. Patent Number 6,223,277 (herein 
referred to as Karguth). Table II below shows the relations between the claims of the instant 
application, '309, and Karguth. A person of ordinary skill in the art at the time the invention was 
made, and as taught by Karguth, would have recognized that the packed data system maximizes 
on-chip utilization of memory and obtains performance at minimum cost (Karguth column 2, 
lines 50-65 "...As a result, packed data structures are attractive in these type of systems..."). 
Therefore, it would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to incorporate the packed data device of Karguth in the device of '309 to 
maximize on-chip memory utilization and minimize cost. 
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Table II 



Instant Application 


Patent Number 7,191,309 


Claim 17 


Claim 1 


A hardware-based multithreaded 
processor comprising: 


A hardware-based multithreaded processor 
comprising: 


A plurality of microengines, 


A plurality of microengines, 


Each of the microengines comprising 


Each of the microengines comprising: 


A context event arbiter, 


Context event switching logic; 


A controller 


C^ontrnllfT locnr - 


A control store, 


A control store 


Local read and writp transfer 

J — i W V* Cll 1. VU \A CXI 1 vi VV 1 1 Lv La (XI 1J ivl 

registers, 


ivtf / ^ L\ III ^Wltllllll O, llllWo 1 If . , . llltiJIVJl y UlLCildCC 

37, for writing results of the operation to parameter 
memorv 18 over busps MFTvTr) lVTFlVfA " and Ficmrp 
3, element 37 - In regards to Karguth, the memory 
interface must hold, e.g. storing them in registers, the 
data, in some way in order to transfer data to and from 
oarameter memorv without it incorrectly chanoino 
and/or influencing the rest of the system. 


Local general purpose registers, and 
an arithmetic logic unit (ALU), 


An execution box data path including an arithmetic 
logic unit (ALU) and a general purpose register set, 


Each of the microengines sunnortin? 
instructions that 


TnP AT, I ] nprforiYimo fnnptinn^ in rpQrioncp tn 

1 11^ • *■ l^j \J | l VJ I 1 1 1 1 1 ] £^ J Vll ti KJl I j 111 IfcoL/VJllSfc HJ 

instructions 


perform an ALU operation on one or 
two operands, 


One of the instructions causing the ALU to load a 
destination register with a 32-bit word formed by 
concatenating a first operand and a second operand to 
form a 64-bit result 


Deposit a result in a destination 
re&ister and 


One of the instructions causing the ALU to load a 

H pel i n Jit l Pin rpcrt ct"f*r with a ^O-Hit \x/r\ivi fnrm c±r\ \\\r 
Utol I llClUUl J 10££lc»lCI VV1U1 Cl WUIU 1U11I1CU UY 

concatenating a first operand and a second operand to 
form a 64-bit result 


Update ALU condition codes 
according to the result; and 


Karguth column 1 1, lines 35-51 "...the test is 
evaluated, thus resulting in a singlecycle test and 
branch instruction. . - In regards to Karguth, the test 
results essentially update the condition codes, 
eventhough they are not stored in a register, in order to 
determine if the condition is true or not and to branch 
accordingly. 


A local register instruction that loads 
one or more bytes, specified by a 
multiple-bit field of the instruction, 
within a local destination register 
with a shifted value of another 
operand, 


One of the instructions causing the ALU to load a 
destination register with a 32-bit word formed by 
concatenating a first operand and a second operand to 
form a 64-bit result, shifting the 64-bit result by a 
speci fied amount, and storing a lower 32-bits of the 
64-bit result. 
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The field representing a mask in 
which each bit of the mask identifies 
a different byte of the destination 
register. 


Kargu/h column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
iio . . . , L-uiuiiui iu. unco i i/*to . , . wnie~DaCiv 

oner?ition fir*pnt"rlino tn flip rwppprrpH pmKnHimpnt r\"F 
VJJJL. 1 c\ 1 aC-^VJI U 11 1 LU lllv jJlClCHCLl C1I1 UUL11111C11 I UI 

the present invention also shifts the output result from 
ALU 30 into the proper position within the result 

word so that nnp or more of thp hvte lnratintiQ mav 
be written with the contents of writeback hns WRRTIS 
(the remaining bits being masked from the write)..." 
and Figure 3 


Claim 18 


Claim 5 


The processor of claim 17, wherein 
the destination register is an absolute 
transfer register. 


The processor of claim 1 wherein the destination 
register is an absolute register name. 


Claim 19 


Claim 6 


The processor of claim 17, wherein 
the destination register is a context- 
relative transfer register. 


The processor of claim 1 wherein the destination 
register is a context relative register name. 


Claim 20 




"The nrnrPQ^nr nf* rlaim 1 7 wliprpin 

lilt |Ji U^VOOUl \Jl l/lCLIlll I/, WilClClll 

the destination recxistpr is $\ crpnpral 

11 IV UVj L111CI UUll lvclol^/i IS d gt'Ilti CXI 

purpose register. 


jvut guiti tuiuiiiii o ; lines l -zj ...aaia results xnai are 

tr\ hp \A/l*i1"f'Pn heir*!/ into rpniotpi* 'flip 7zl a r*/=» a-r*r\l i 

iu ul. wi iiLtu udtK liuu 1 eg later iiic are appneu to 
shifter 34. to place the operand in the appropriate bit 
positions of the destination of one of registers..."; 
column 9, lines 27-32 "...provide a five-bit selection 
code by way of which the destination and source 
registers, respectively, for the instruction are 
addressed..."; and Figure 3 


Claim 21 




Thp nror ps^nr of rlaim 1 7 wViprpin 
the local recnster instruction 

HIV IVJvUI 1 v ill u Ivl 1 1 ID H d V UU 1 1 

comprises the destination register. 


j\ut qui ft cui uiii ii o, liiicb i -z j .. .Uciia results mai are 

LU Uo WLUldl UalK I11LU ICglolCI 111c cue appilcQ lO 

shifter 34, to place the operand in the appropriate bit 
positions of the destination of one of registers..."; 
column 9. lines 27-32 "...provide a five-bit selection 
code by way of which the destination and source 
registers, respectively, for the instruction are 
addressed...."; and Figure 3 


Claim 23 




The processor of claim 1 7, wherein 
the mask is 4-bits. 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:] 3 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
28..."; column 10, lines 10-48 "...write-back 
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operation according to the preferred embodiment of 
the present invention also shifts the output result from 

A 1 II Xll ii^t"^ tnP nrAnat* nACiti An nntniM + U\ r± tmn milt 

alu ju liuu uiL pioptr position wunin ine resun 

worn cn In at P*np cw mnrp r\r thp Vw/tf* lr\r»atirtnc mo\/ 

WU1U...0U uiaL uiiL^ ui liiuic ui li ic uy ic lUL/aiiuna may 
be written with the contents of writeback bus WBBUS 
(the remaining bits being masked from the write)..." 
and Figure 3 - in regards to Karguth, the exemplary 
mask is 3-bits, however, the size of the mask field 
does not matter and it is only an exemplary 
embodiment. 


Claim 24 




The processor of claim 17, wherein 
the mask comprises a set bit 
indicating a corresponding byte in 
the local register to be loaded. 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
£o... , luiuinn tu. lints j u-ho . . . wriie-oacK 
upcidLiuii ciocuiciiii^ iu uiL picieireu emuouimeni or 
the present invention also shifts the output result from 
ALU 30 into the proper position within the result 
word... so that one or more'of the byte locations may 
be written with the contents of writeback bus WBBUS 

( thp r^m?iii1infT Kite npnin mQcL-^rl "From tVio nrrito\ " 
^Lii^ I ^iiiai l llll^ Ullo UClilg lllahNCU ilUlli U1C WllLCI... 

and Fipurp 


Claim 25 


I *a i in < 7 ii n r 1 'X 


The processor of claim 17, wherein 
the local register comnrises a context 
relative source register. 


The processor of claim 1 wherein the first operand is a 

COni P Vt-rpl PltlVP 19 -hit rpcnctpr 

The orocessor of claim 1 wherein the second onerand 
is a context-relative 32-bit register. 


Claim 26 


Claim 1 


Apparatus comprising: 




A hardware-based multithreaded 
processor comprising 


A hardware -based multithreaded processor 
comprising: 


A plurality of microengines, 


A plurality of microengines. 


Each of the microengines comprising 


Each of the microengines comprising: 


A context event arbiter, 


Context event switching logic; and 


A controller, 


Controller logic; 


A control store, 


A control store:- 


Local read and write transfer 
registers, 


Karguth column 8, lines 12-17 "...memory interface 
37. for writing results of the operation to parameter 
memory 18 over buses MEMD, MEMA..." and Figure 
3, element 37 - In regards to Karguth, the memory 
interlace must hold, e.g. storing them in registers, the 
data in some way in order to transfer data to and from 
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parameter memory without it incorrectly changing 
and/or influencing the rest of the system. 


Local general purpose registers, and 
an arithmetic logic unit (ALU), 


An execution box data path including an arithmetic 
logic unit (ALU) and a general purpose register set, 


HHf*n at tnf* r\ n iyc\ 1 1 at miprnpnfrmpc 
J_ydVll \Ji. lilt JJiUlcllliy Ul llllvi UCll^lllCo 

including a command that causes the 
ALU to load one or more bytes, 
specified by a multiple-bit field of 
the command, within a destination 
register of a selected microengine 
with a shifted value of another one or 
more bytes of a source register, 


iiit /alu pciiuiiuiiig iui leu oiis in response 10 
instructions, one of the instructions causing the ALU 
to load a destination register with a 32-bit word 
formed by concatenating a first operand and a second 
operand to form a 64-bit result, shifting the 64-bit 
result by a specified amount, and storing a lower 32- 
bits of the 64-bit result. 


The field representing a mask in 
which each bit of the mask identifies 
a different byte of the destination 
register. 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
jl<3 . . . , cojuinii lu.iiiits i u-Ho . . . wnie-uacK 

Upt/lculUIl aL-COl till 11^ IU U1C jJIClCliCU CIIlUOUUIlCIll Ul 

the present invention also shifts the output result from 
ALU 30 into the proper position within the result 
word... so that one or more of the byte locations may 
be written with the contents of writeback bus WBBUS 
(the remaining bits being masked from the write)..." 
and Figure 3 


Claim 28 




The apparatus of claim 26, wherein 
the mask is 4-bits. 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
registers are to be selected by the shift/mask units 
28../'; column 10, lines 10-48 "...write-back 
operation according to the preferred embodiment of 
the present invention also shifts the output result from 
alu ju lnio ine pioper position witnin tne result 
VVUILI...SU Liicti one oi liioic oi uie uyie locations may 
be written with the contents of writeback bus WBBUS 
(the remaining bits being masked from the write)..." 
and Figure 3 — In regards to Karguth the exemnlarv 
mask is .3-bits, however, the size of the mask field 
does not matter and it is only an exemplary 
embodiment. 


Claim 29 




The apparatus of claim 26, wherein 
the mask comprises a set bit 
indicating a corresponding byte in 


Karguth column 9, lines 33-38 "Bit position 7:5 and 
15:13 each provide a three-bit code, by way of which 
the desired portion of the destination and source 
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the source register to be loaded. 



registers are to be selected by the shift/mask units 
28..."; column 10, lines 10-48 "...write-back 
operation according to the preferred embodiment of 
the present invention also shifts the output result from 
ALU 30 into the proper position within the result 
word. . .so that one or more of the byte locations may 
be written with the contents of writeback bus WBBUS 
(the remaining bits being masked from the write)..." 
and Figure 3 



Claim Rejections - 35 USC §103 

9. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences bet ween the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

10. Claims 17, 20-21, 23-26, and 28-29 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Karguth, U.S. Patent Number 6,223,277 (herein referred to as Karguth) in 
view of David K. Probst's "Programming, Compiling and Executing Partially-Ordered 
Instruction Streams on Scalable Shared-Memory Multiprocessors" from Proceedings of the 
Twenty-Seventh Annual Hawaii International Conference on System Sciences, 1994 © 1 994 
IEEE (herein referred to as Probst). 

11. Referring to claims 17 and 26, taking claim 17 as exemplary, Karguth has taught a 
processor comprising: 

a. A plurality of microengines (Karguth column 5, lines 12-16 "...another instance 
of network hub and ATM translator 5 would be implemented in place of ATM 
premises switch 8..."; column 3, lines 40-43 "The present invention may be 
implemented in a micro-processor architecture..."; and Figure 1, element 5), 
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b. Each of the microengines comprising 

i. A controller (Kargulh column 7, lines 47-66 ".. .decoded by control and 
instruction decode circuitry 32.. and Figure 3, element 32), 

ii. A control store (Kargulh column 7, lines 47-66 ". . .under the control of 
instructions retrieved from instruction memory 38..." and Figure 3, 
element 32), 

iii. Local read and write transfer registers (Karguth column 8, lines 12-17 
"...memory interface 37, for writing results of the operation to parameter 
memory 18 over buses MEMD, MEM A..." and Figure 3, element 37 - In 
regards to Karguth, the memory interface must hold, e.g. storing them in 
registers, the data in some way in order to transfer data to and from 
parameter memory without it incorrectly changing and/or influencing the 
rest of the system.), 

iv. Local general purpose registers (Karguth column 7, lines 12-29 "...Each 
of the remaining registers REGO through REG30 are general purpose 
registers..." and Figure 3, element 24), and 

c. An arithmetic logic unit (ALU) (Karguth column 7, lines 47-67 "ALU 30 includes 
the appropriate circuitry for executing arithmetic and logical operations upon 
operands presented thereto.. and Figure 3, element 30), 

d. Each of the microengines supporting instructions that perform an ALU operation 
on one or two operands (Karguth column 7. lines 47-67 "ALU 30 includes the 
appropriate circuitry for executing arithmetic and logical operations upon 
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operands presented thereto..."'; column 9, lines 27-32 "...provide a five-bit 
selection code by way of which the destination and source registers, respectively, 
for the instruction are addressed... 55 ; column 9, lines 58 to column 10, line 9 
"...contain an immediate operand value for use... contain a second source 
register..."; and Figure 3; element 30), deposit a result in a destination register 
(Karguth column 8, lines 1-23 "...data results that are to be written back into 
register file 24 are applied to shifter 34, to place the operand in the appropriate bit 
positions of the destination of one of registers.. ."; column 9, lines 27-32 
"...provide a five-bit selection code by way of which the destination and source 
registers, respectively, for the instruction are addressed. . and Figure 3) and 
update ALU condition codes according to the result (Karguth column 11, lines 
35-51 "...the test is evaluated, thus resulting in a singlecycle test and branch 
instruction. . ." - In regards to Karguth, the test results essentially update the 
condition codes, eventhough they are not stored in a register, in order to 
determine if the condition is true or not and to branch accordingly.); and 
e. A local register instruction that loads one or more bytes, specified by a multiple- 
bit field of the instruction, within a local destination register with a shifted value 
of another operand (Karguth column 7. lines 47-67 "ALU 30 includes the 
appropriate circuitry for executing arithmetic and logical operations upon 
operands presented thereto..."'; column 9, lines 27-32 "...provide a five-bit 
selection code by way of which the destination and source registers, respectively, 
for the instruction are addressed... 55 ; column 9, lines 58 to column 10, line 9 
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"...contain an immediate operand value for use... contain a second source 
register..."; and Figure 3, element 30), 
f. The field representing a mask in which each bit of the mask identifies a different 
byte of the destination register (Karguth column 9, lines 33-38 "Bit position 7:5 
and 15:13 each provide a three-bit code, by way of which the desired portion of 
the destination and source registers are to be selected by the shift/mask units 
28..."; column 10, lines 10-48 "...write-back operation according to the preferred 
embodiment of the present invention also shifts the output result from ALU 30 
into the proper position within the result word. . .so that one or more of the byte 
locations may be written with the contents of writeback bus WBBUS (the 
remaining bits being masked from the write)., and Figure 3). 
12. Karguth has not taught a hardware-based multithreaded and a context event arbiter. 
Probst has taught a hardware-based multithreaded (Probst Section 2, paragraph 1 
"Multithreading is commonly suggested as a technique...") and a context event arbiter (Probst 
Section 2, paragraph 4 "...Multithreaded architectures differ in their context switching 
policies..." and Section 2, paragraphs 5-6 "Block multithreaded processors, which switch 
contexts only when a high-latency operation is encountered.. ." - In regards to Probst, the 
mechanism controlling the context switching as taught by Probst is a context event arbiter.). A 
person of ordinary skill in the art at the time the invention was made, and as taught by Probst, 
would have recognized that multithreading improves tolerance of latencies and increases 
processor utilization (Probst Section 2, paragraph 1 "Multithreading is commonly suggested as a 
technique for tolerating latencies and increasing processor utilization..."). Therefore, it would 
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have been obvious to a person of ordinary skill in the art at the time the invention was made to 
incorporate the multithreading of Probst in the device of Karguth to improve latency tolerance 
and increase processor utilization. 

13. Claim 26 has similar limitations to claim 17 and is rejected for similar reasons. Claim 26 
differs from claim 17 only in that claim 26 is for an apparatus while claim 17 is a processor. 

14. Referring to claim 20, Karguth in view of Probst has taught the processor of claim 17, 
wherein the destination register is a general purpose register (Karguth column 8, lines 1-23 
"...data results that are to be written back into register file 24 are applied to shifter 34, to place 
the operand in the appropriate bit positions of the destination of one of registers..."; column 9, 
lines 27-32 ''...provide a five-bit selection code by way of which the destination and source 
registers, respectively, for the instruction are addressed..."; and Figure 3). 

15. Referring to claim 21, Karguth in view of Probst has taught the processor of claim 17, 
wherein the local register instruction comprises the destination register (Karguth column 8, lines 
1-23 . .data results that are to be written back into register file 24 are applied to shifter 34, to 
place the operand in the appropriate bit positions of the destination of one of registers..."; 
column 9, lines 27-32 "...provide a five-bit selection code by way of which the destination and 
source registers, respectively, for the instruction are addressed..."; and Figure 3). 

16. Referring to claims 23 and 28, taking claim 23 as exemplary, Karguth in view of Probst 
has taught the processor of claim 1 7, wherein the mask is 4-bils (Karguth column 9, lines 33-38 
"Bit position 7:5 and 15:13 each provide a three-bit code, by way of which the desired portion of 
the destination and source registers are to be selected by the shift/mask units 28..."; column 10, 
lines 10-48 "...write-back operation according to the preferred embodiment of the present 



Application/Control Number: 09/8 1 1 ,995 Page 20 

Art Unit: 2183 

invention also shifts the output result from ALU 30 into the proper position within the result 
word. . .so that one or more of the byte locations may be written with the contents of writeback 
bus WBBUS (the remaining bits being masked from the write)..." and Figure 3 - In regards to 
Karguth, the exemplary mask is 3-bits, however, the size of the mask field does not matter and it 
is only an exemplary embodiment.). 

17. Claim 28 has similar limitations to claim 23 and is rejected for similar reasons. Claim 28 
differs from claim 23 only in that claim 28 is for an apparatus while claim 23 is a processor. 

18. Referring to claims 24 and 29, taking claim 24 as exemplary, Karguth in view of Probst 
has taught the processor of claim 17, wherein the mask comprises a set bit indicating a 
corresponding byte in the local register to be loaded (Karguth column 9, lines 33-38 "Bit position 
7:5 and 15:13 each provide a three-bit code, by way of which the desired portion of the 
destination and source registers are to be selected by the shift/mask units 28..."; column 10, lines 
10-48 "...write-back operation according to the preferred embodiment of the present invention 
also shifts the output result from ALU 30 into the proper position within the result word.. .so that 
one or more of the byte locations may be wri tten with the contents of writeback bus WBBUS 
(the remaining bits being masked from the write)... 15 and Figure 3). 

19. Claim 29 has similar limitations to claim 24 and is rejected for similar reasons. Claim 29 
differs from claim 24 only in that claim 29 is for an apparatus while claim 24 is a processor. 

20. 

21. Claims 1 8-19 are rejected under 35 U.S.C. 103(a) as being unpatentable over Karguth, 
U.S. Patent Number 6,223,277 (herein referred to as Karguth) in view of David K. Probst's 
"Programming, Compiling and Executing Partially-Ordered Instruction Streams on Scalable 
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Shared-Memory Multiprocessors" from Proceedings of the Twenty-Seventh Annual Hawaii 
International Conference on System Sciences. 1994 ©1994 IEEE (herein referred to as Probst) as 
applied to claim 17 above, and further in view of Vincent P. Heuring and Harry F. Jordan's 
Computer Systems Design and Architecture ©1997 (herein referred to as Heuring). 

22. Referring to claims 18, 19, and 25, Karguth in view of Probst has not explicitly taught 

a. The processor of claim 17, wherein the destination register is an absolute transfer 
register (Applicant's claim 1 8). 

b. The processor of claim 17, wherein the destination register is a context-relative 
transfer register (Applicant's claim 19). 

c. The processor of claim. 17, wherein the local register comprises a context-relative 
source register (Applicant's claim 25). 

23. However, Karguth has taught accessing registers but not specifically how these registers 
are accessed, i.e. that the registers are addressed via absolute transfers or context-relative 
transfers. Heuring has taught 

a. The processor of claim 17, wherein the destination register is an absolute transfer 
register (Applicant's claim 1 8) (Heuring pages 69-71, Table 2.8). 

b. The processor of claim 17, wherein the destination register is a context-relative 
transfer register (Applicant's claim 19) (Heuring pages 69-71, Table 2.8). 

c. The processor of claim 17, wherein the local register comprises a context relative 
source register (Applicant's claim 25) (Heuring pages 69-71, Table 2.8). 

24. A person of ordinary skill in the art at the time the invention was made would have 
recognized that have an addressing method for the registers ensures that the correct registers are 
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accessed when data needs to be retrieved, i.e. read, or written, i.e. stored, in the registers. 
Therefore, it would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to incorporate the register address schemes of Heuring in the device of 
Karguth to ensure the correct registers are accessed. 

Response to Arguments 

25. Applicant's arguments, see Amendment, filed 1 1 January 2007, with respect to the 
rejection(s) of claim(s) 17-21, 23-26, and 28-29 laid forth in the Office Action dated 31 October 
2006 have been fully considered and are persuasive. Therefore, the rejection has been 
withdrawn. However, upon further consideration, a new ground(s) of rejection is made in view 
of the above. 

Conclusion 

26. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. 

a. Stormon et al., U.S. Patent Number 5,860,085, has taught a load shift instruction 
with a mask that specifies which locations in a memory array to write to. 

b. Huff et al. 5 U.S. Patent Number 6,052,769, has taught a write mask for 
compressed data which indicates which portion of compressed data belongs in 
specific locations. 

27. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Aimee J. Li whose telephone number is (571) 272-4169. The 
examiner can normally be reached on M-T 7:00am-4:30pm. 
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28. If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Eddie Chan can be reached on (571) 272-4162. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

29. Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 




Aimee J Li ^ 
Examiner 
Art Unit 2183 
25 March 2007 



